Xtended Bic Criterion for Model Selection
نویسندگان
چکیده
Model selection is commonly based on some variation of the BIC or minimum message length criteria, such as MML and MDL. In either case the criterion is split into two terms: one for the model (data code length/model complexity) and one for the data given the model (message length/data likelihood). For problems such as change detection, unsupervised segmentation or data clustering it is common practice for the model term to comprise only a sum of sub-model terms. In this paper it is shown that the full model complexity must also take into account the number of sub models and the labels which assign data to each sub model. From this analysis we derive an extended BIC approach (EBIC) for this class of problem. Results with artificial data are given to illustrate the properties of this procedure.
منابع مشابه
Model Selection for Mixtures of Factor Analyzers via Hierarchical BIC
Bayesian information criterion (BIC) is a common model selection criterion for mixtures of factor analyzers (MFA). However, it is found that BIC penalizes each factor analyzer implausibly using the whole sample size. In this paper, we propose a new criterion for MFA called hierarchical BIC (H-BIC). Formally, the main difference from BIC is that H-BIC penalizes each factor analyzer using its own...
متن کاملSpeaker segmentation using the MAP-adapted Bayesian information criterion
The Bayesian information criterion (BIC) is a model selection criterion that has previously been applied to speaker segmentation of broadcast news by several researchers. The BIC approach treats speaker segmentation as a model selection problem. As the BIC requires the estimation of the sample covariance matrix, its performance tends to deteriorate as the speaker-turn duration decreases. It is ...
متن کاملGeometric BIC
The author introduced the “geometric AIC” and the “geometric MDL” as model selection criteria for geometric fitting problems. These correspond to Akaike’s “AIC” and Rissanen’s “BIC”, respectively, well known in the statistical estimation framework. Another criterion well known is Schwarz’ “BIC”, but its counterpart for geometric fitting has been unknown. This paper introduces the corresponding ...
متن کاملBayes Factors and BIC Comment on “ A Critique of the Bayesian Information Criterion for Model Selection ”
I would like to thank David L. Weakliem (1999 [this issue]) for a thought-provoking discussion of the basis of the Bayesian information criterion (BIC). We may be in closer agreement than one might think from reading his article. When writing about Bayesian model selection for social researchers, I focused on the BIC approximation on the grounds that it is easily implemented and often reasonabl...
متن کاملA Novel Bayesian Cluster Enumeration Criterion for Unsupervised Learning
The Bayesian Information Criterion (BIC) has been widely used for estimating the number of data clusters in an observed data set for decades. The original derivation, referred to as classic BIC, does not include information about the specific model selection problem at hand, which renders it generic. However, very little effort has been made to check its appropriateness for cluster analysis. In...
متن کامل